Detection of Baby Voice and its Application Using Speech Recognition System and Fundamental Frequency Analysis
نویسندگان
چکیده
We propose a method for detecting a baby voice using a speech recognition system and fundamental frequency analysis. We propose the following two conditions for recognizing a sound form segment of a baby voice. Condition 1: The word reliability for a sound form segment obtained by using Julius is under a threshold, Condition 2: For a certain time period, the fundamental frequency of the sound form segment changes by another threshold or over. When at least one of the above two conditions is met, the sound form segment is judged as coming from a baby voice. We successfully applied the proposed method to pattern recognition of a baby’s emotion. Key-Words: Baby Voice, Speech Recognition System, Fundamental Frequency, Emotion, Pattern Recognition, Baby Care Support
منابع مشابه
Voice-based Age and Gender Recognition using Training Generative Sparse Model
Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...
متن کاملبررسی برخی ویژگی های آکوستیک گفتار نوزاد مدار در مادران فارسی زبان
Introduction: When adults talk to another person, linguistic characteristics of the listener will also be considered. A clear example of speech changes depending on the listener is maternal or infant directed speech. Infant directed speech is more slowly with longer sentences and pauses at the end of the utterance. Undoubtedly the most distinctive feature of this style of speech is acoustic c...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملبررسی ویژگیهای آکوستیکی مربوط به کنترل حرکتی گفتار در کودکان لکنتی و غیرلکنتی
Objective Stuttering is a developmental disorder of speech fluency with unknown causes. One of the proposed theories in this field is deficits in speech motor control that is associated with damaged control, timing, and coordination of the speech muscles. Fundamental frequency, fundamental frequency range, intensity, intensity range, and voice onset time are the most important acoustic componen...
متن کاملThe Study of Vocal Function in Patients With Early Laryngeal Carcinoma After Transoral Laser Microsurgery
Objective Today transoral laser microsurgery is considered as one of the first options to control early laryngeal cancer, and voice disorder is one of the inevitable complications of this therapeutic component. This study aimed to compare the vocal function in patients with early-stage laryngeal cancer following laser surgery with healthy individuals with normal voice quality using acoustic ana...
متن کامل